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Degraded Broadcast Channel with Side 
Information, Confidential Messages and with or 
without Noiseless Feedback 

Bin Dai, A. J. Han Vinck, Fellow, IEEE, Zhuojun Zhuang, and Yuan Luo 

Abstract 

In this paper, first, we investigate the model of degraded broadcast channel with side information and confidential 
messages. This work is from Steinberg's work on the degraded broadcast channel with causal and noncausal side 
information, and Csiszdr-Korner's work on broadcast channel with confidential messages. Inner and outer bounds 
on the capacity-equivocation regions are provided for the noncausal and causal cases. Superposition coding and 
double-binning technique are used in the corresponding achievability proofs. 

Then, we investigate the degraded broadcast channel with side information, confidential messages and noiseless 
feedback. The noiseless feedback is from the non-degraded receiver to the channel encoder. Inner and outer bounds 
on the capacity-equivocation region are provided for the noncausal case, and the capacity-equivocation region is 
determined for the causal case. Compared with the model without feedback, we find that the noiseless feedback helps 
to enlarge the inner bounds for both causal and noncausal cases. In the achievability proof of the feedback model, 
the noiseless feedback is used as a secret key shared by the non-degraded receiver and the transmitter, and therefore, 
the code construction for the feedback model is a combination of superposition coding, Gel'fand-Pinsker's binning, 
block Markov coding and Ahlswede-Cai's secret key on the feedback system. 

Index Terms 

Confidential message, capacity-equivocation region, degraded broadcast channel, noiseless feedback, secrecy 
capacity, side information. 

I. Introduction 

Equivocation was first introduced into channel coding by Wyner in his study of wiretap channel fl31 . It is a kind 
of degraded broadcast channels. The object is to transmit messages to the legitimate receiver, while keeping the 
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wiretapper as ignorant of the messages as possible. After the publication of Wyner's work, Csiszdr and Korner fH 
investigated a more general situation: the broadcast channels with confidential messages, see Figure [T] The model 
of |4j is to transmit confidential messages to receiver 1 at rate R\ and common messages to both receivers at 
rate R , while keeping receiver 2 as ignorant of the confidential messages as possible. Measuring ignorance by 
equivocation, a single-letter characterization of all the achievable triples (R\, R £ ,Rq) was provided in [4|, where R e 
is the second receiver's equivocation to the confidential messages. Note that the model of (4) is also a generalization 
of [8 1, where no confidentiality condition is imposed. In addition, Merhav lITTI studied a specified wiretap channel, 
and obtained the capacity region, where both the legitimate receiver and the wiretapper have access to some leaked 
symbols from the source, but the channels for the wiretapper are more noisy than the legitimate receiver, which 
shares a secret key with the encoder. 
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Fig. 1: Broadcast channels with confidential messages 



In communication systems there is often a feedback link from the receiver to the transmitter^. g. the two-way 
channels for telephone connections. It is well known that feedback does not increase the capacity of discrete 
memoryless channel (DMC). However, does the feedback increase the capacity region of the wiretap channel? In 
order to solve this problem, Ahlswede and Cai studied the general wiretap channel (the wiretap channel does not 
need to be degraded) with noiseless feedback from the legitimate receiver Q), ar >d both upper and lower bounds 
of the secrecy capacity were provided. Specifically, for the degraded case, they showed that the secrecy capacity 
is larger than that of Wyner's wiretap channel (without feedback). In the achievability proof, Ahlswede and Cai 
H] used the noiseless feedback as a secret key shared by the transmitter and the legitimate receiver, while the 
wiretapper had no additional knowledge about the key except his own received symbols. Besides Ahlswede and 
Cai's work, the wiretap channel with noisy feedback was studied in |10|, and the wiretap channel with secure 
rate-limited feedback was studied in J3, and both of them focused on bounds of the secrecy capacity. 

The coding for channels with causal side information at the encoder was first investigated by Shannon |fT3l in 
1958. After that, in order to solve the problem of coding for a computer memory with defective cells, Kuznetsov and 
Tsybakov |9| considered a channel in the presence of noncausal side information at the transmitter. They provided 
some coding techniques without determination of the capacity. The capacity was found in 1980 by Gel'fand and 
Pinsker (7). Furthermore, Costa |6) investigated a power-constrained additive noise channel, where part of the 
noise is known at the transmitter as side information. This channel is also called dirty paper channel. Based on 
the dirty paper channel, C. Mitrpant et al. Ifl2l studied the Gaussian wiretap channel with side information, and 
provided an inner bound on the capacity-equivocation region. Furthermore, Y. Chen et al. [3] investigated the 
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discrete memoryless wiretap channel with noncausal side information, and also provided an inner bound on the 
capacity-equivocation region. Note that the coding scheme of [3] is a combination of those in Q, ifTSl . In order 
to introduce side information to the broadcast channel, Steinberg investigated the degraded broadcast channel with 
side information lfl4ll . where both causal and noncausal side information were considered in his paper. Specifically, 
inner and outer bounds on capacity region were provided for the degraded broadcast channel with noncausal side 
information lfl4ll . and meanwhile, the capacity region of the degraded broadcast channel with causal side information 
was totally determined [14]. 

In this paper, we study the model of degraded broadcast channel with side information, confidential messages and 
with or without noiseless feedback (see Figure [2}. The model of this paper is from Steinberg's work on degraded 
broadcast channel with side information at the encoder, and the model of broadcast channel with confidential 
messages provided by Csiszdr and Korner. In Figure [2] S is the confidential message sent to receiver 1, and T is 
the common message sent to both receiver 1 and receiver 2. The transition probability of channel 1 depends on the 
side information V N , and V N is available to the channel encoder in a causal or noncausal manner. Receiver 2 can 
get a degraded version of Y N via channel 2. In addition, there may exist a noiseless feedback from the output of 
channel 1 to the channel encoder. 



(causal or noncausal manner to the channel encoder) 
V N • 



(S,T) 



Channel encoder! 



Channel 1 



Transmitter 



noiseless feedback Y x 1 
at time i 



Channel 2 



Receiver 1 

•Receiver 2 



rN 



Fig. 2: Degraded broadcast channel with side information, confidential messages and with or without noiseless 
feedback 



In this paper, random variables, sample values and alphabets are denoted by capital letters, lower case letters 
and calligraphic letters, respectively. A similar convention is applied to the random vectors and their sample values. 
For example, U N denotes a random TV-vector (Ui, Un), and u N = (u±, ...,un) is a specific vector value 
in U N that is the Nth Cartesian power of U. Uf denotes a random N - i + 1-vector (£/*, Un), and 
uf = (iii, ...,-Ujv) is a specific vector value in Uf . Let pv(v) denote the probability mass function Pr{V = v}. 
Throughout the paper, the logarithmic function is to the base 2. 

The organization of this paper is as follows. In Section [II] inner and outer bounds on the capacity-equivocation 
regions of the model of Figure [2] without feedback are provided by Theorem [T] to Theorem [4] Specifically, the inner 
and outer bounds for the model of Figure [2] with noncausal side information and without feedback are provided in 
Theorem [T] and Theorem [2] respectively. The inner and outer bounds for the model with causal side information and 
without feedback are provided in Theorem [5] and Theorem [4] respectively. In Section III inner and outer bounds on 
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the capacity-equivocation region of the model of Figure [2] with noncausal side information and noiseless feedback 
are provided in Theorem [5] and Theorem [6] respectively. The capacity-equivocation region of the model with causal 
side information and noiseless feedback is given in Theorem [7] Section [IV] is for a binary example about the model 
of Figure [2] with causal side information and noiseless feedback. Final conclusions are provided in Section [Vj 

II. The Model of Figure [2] without Feedback 

In this section, the model of Figure [2] without feedback is considered into two parts. The model of Figure [2] with 
noncausal side information and without feedback is described in Subsection |II-A| and the model of Figure [2] with 



causal side information and without feedback is described in Subsection II-B see the following 



A. The Model of Figure [2] with Noncausal Side Information and without Feedback 

In this subsection, a description of the model of Figure [2] with noncausal side information and without feedback is 
given by Definition[T]to Definition[5] The inner and outer bounds on the capacity-equivocation region composed 
of all achievable (i?o, Ri, R e ) triples are in Theorem [T] and Theorem [2] where the achievable (i?o, R\, Re) triple 
is defined in Definition [6] 

Definition 1: (Channel encoder) The confidential message S takes values in S, and the common message T 
takes values in T. S and T are independent and uniformly distributed over their ranges. V N is the side information 
of channel 1, and it is the output of a discrete memory less source pv(v). In addition, V N is available to the channel 
encoder in a noncausal manner. Note that V N is independent of S and T. At the i-th time, the inputs of the channel 
encoder are S, T and V N , while the output is X i7 i.e., the channel encoder is a mapping 

fr-.S xTxV N ^ (2.1) 

where /j(s, t, v N ) = Xi G X, s G S, t G T and v N G V N . The transmission rates of the confidential message and 
the common message are ^j^H and 1 ° s |/ r ^ , respectively. 

Definition 2: (Channel 1) Channel 1 is a DMC with finite input alphabet X x V, finite output alphabet y, and 
transition probability Qi(y\x, v), where x £ X, v G V, y G y. Qi(y N \x N , v N ) = Y[ r >-1 Ql(Vn\xm v n)- The inputs 
of Channel 1 are X N and V , while the output is Y N . 

Definition 3: (Channel 2) Channel 2 is a DMC with finite input alphabet y, finite output alphabet Z, and 
transition probability Q 2 (z\y), where y G y, z G Z. Qi{z N \y N } — W n -\ Qi{z n \y^). The inputs of Channel 2 is 
Y N , while the output is Z N . Receiver 2's equivocation to the confidential message is defined as 

A = ^H{S\Z N ). (2.2) 

The cascade of Channel 1 and Channel 2 is another DMC with transition probability 

Q 3 (z\x, v) = ^2 Qw{z\v)Qm{v\ x , v )- (2-3) 

Definition 4: (Decoder 1) Decoder 1 is a mapping fni ■ y N — >• S x T, with input Y N and outputs S,T. Let 
P e i be the error probability of receiver 1 , and it is defined as Pr{(S, T) ^ (5, T)}. 
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Definition 5: (Decoder 2) Decoder 2 is a mapping fjj 2 ■ Z N — > T, with input Z N and output T. Let P e 2 be 
the error probability of receiver 2 , and it is defined as Pr{T =^ T}. 

Definition 6: (Achievable (Rq, Ri,R e ) triple in the model of Figure [2] with noncausal side information and 
without feedback) A triple (R , R 1 , R e ) (where R 0l Ri,R e > 0) is called achievable if, for any e > (where e is 
an arbitrary small positive real number and e 0), there exists a channel encoder-decoder (N, A, P e x, P^) such 
that 

log II T || = Um lQ g II S || = Hm A ^ ^ £ _ 

JV->co A AT->oo A N^too 

The following Theorem [T] and Theorem [2] provide inner and outer bounds on the capacity-equivocation region 
for the model of Figure |] with noncausal side information and without feedback, and they are proved in 
Appendix |A| and Appendix |B| 

Theorem 1: A single-letter characterization of the region is as follows, 

TZ^ = {(Ro,Ri,R e ) :0<R e <Ri, 
R <I(U;Z)-I(U;V), 
Ri<I(K;Y\U)-I(K;V\U), 
R e <I(K;Y\U)-I(K;Z\U)}, 
where p UKVX Yz{u,k,v,x,y, z) = p Z \ Y (z\y)PY\x,v{y\x,v)PUKXv(u,k,x,v), which implies that (U,K) -t 

(x,v) ->y -> 

The region ^("^ satisfies C 7?,^. 

Remark 1: There are some notes on Theorem [T] see the following. 

• The region is convex, and the proof is directly obtained by introducing a time sharing random variable 
into Theorem [T] and therefore, we omit the proof here. 

• The ranges of the random variables U and K satisfy 

IM<||*||||V||+2, 
||/C||<(||A-||||V||+2) 2 . 

The proof is in Appendix [C] 

• Without the secrecy parameter R e , the region TZ( m *> is exactly the same as the achievable region for the 
degraded broadcast channel with noncausal side information [14|. 

• The points in TZ^ for which R e = R\ are of considerable interest, which imply H(S) = H(S\Z N ). 
Definition 7: (The secrecy capacity ci™^) The secrecy capacity ci™^ of the model of Figure [2] with noncausal 
side information and without feedback, is denoted by 

Ci n) = max R x . (2.5) 

(R o =0,Ri,R e =Ri)en(^ 



Furthermore, the secrecy capacity Cg satisfies 

Ci Tl) >maxmin{I(K;Y\U)-I(K;Z\U),I{K;Y\U)-I{K;V\U)}. (2.6) 
Proof of (2.6): Substituting R e — R\ and Rq = into the region 

TZ(m) in Theorem [I] we have 
Ri < I(K;Y\U) - I(K;V\U), (2.7) 
Ri < I(K;Y\U)-I(K;Z\U). (2.8) 

(2.9) 

Note that the triple (R = 0, R x = niaxmin{/(if; Y\U) - I(K; Z\U), I(K; Y\U) - I(K;V\U)}, R e = Ri) 
is achievable, and therefore, the secrecy capacity Cs > raaxmin{/(if ; Y\U) — I(K; Z\U), I(K;Y\U) — 
I(K; V\U)}. Thus the proof is completed. ■ 
Theorem 2: A single-letter characterization of the region TZ^'' 10 ^ is as follows, 

={(R o ,R 1 ,R e );0<R e <R 1 , 

Ro<I(U;Z)-I(U;V), 

R 1 < I(K;Y\U,A) - I(K;V\U,A), 

Ro + Ri <I(U,A,K;Y)-I(U,A,K;V), 

Re < I(K; Y\U, A) - I{K; V\U, A) - I[K; Z\U) + I(K; V\U)}, 
where pukavxyz{u, k, a, v, x, y, z) = Pz\y(z\u)py\x,v(u\x, v)p UKA xv(u, fc, a, x, v), which implies that (J7, K, A) 

(x,v) -)• y z. 

The region Tl^ satisfies 72, W c K [no \ 

Remark 2: There are some notes on Theorem [2] see the following. 

• The region 

n {no) 

is convex, and the proof is directly obtained by introducing a time sharing random variable 
into Theorem [2] and therefore, we omit the proof here. 

• The ranges of the random variables U and K satisfy 

l|W||<ll*lll|V||+2, 
M||<(||A"||||V||+2)(||*||||V||+1), 
||/C||<(P||||V||+2)(||A-||||V||+l)(||^||||V||+2) 2 . 

The proof is similar to the proof of the second part of Remark [T] and it is omitted here. 

• Without the secrecy parameter R e , the region 7?/™°) is exactly the same as the outer bound for the degraded 
broadcast channel with noncausal side information [14|. 
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B. The Model of Figure [2] with Causal Side Information and without Feedback 

The model of Figure [2] with causal side information and without feedback is similar to the model in Subsection 
except that the side information V N in Definition [l] is known to the channel encoder in a causal manner, i.e., 



H-A 



at the i-th time (1 < i < N), the output of the channel encoder xi = fi(s,t,v l ), where v l — (vi,V2, ■■■,Vi) and /j 
is the mapping at time i. Note that V t is independent of (Y i_1 , T, V^, Z 1 ^ 1 ), where = {V i+ i, V i+2l V N ), 
Y*- 1 = (Yi,Y 2 , Yj_i) and Z^ 1 = {Z 1 ,Z 2 ,...,Z i _ 1 ). 

The following Theorem [3] and Theorem |4] provide inner and outer bounds on the capacity-equivocation region 
TZ^ for the model of Figure |5] with causal side information and without feedback, and they are proved in Appendix 
|D] and Appendix [E] 

Theorem 3: A single-letter characterization of the region is as follows, 

K {a) ={(R ,R 1 ,R e ) :0<R e <Ri, 

R <I(U;Z), 

Ri<I(K;Y\U), 

R e <I{K;Y\U)-I(K;Z\U)}, 

where p UKVX Yz(u,k,v,x,y,z) = Pz\Y(z\y)PY\x,v(y\x,v)p x \ UKV (x\u,k,v)p UK (u,k)pv(v), which implies 
that (U, K) -)• (X, V) -)• Y -> Z and V is independent of U and K. 
The region satisfies 7?.( C4 ) C /^.( c ). 

Remark 3: There are some notes on Theorem [3] see the following. 

• The region is convex, and the proof is omitted here. 

• The ranges of the random variables U and K satisfy 

||W||< ||*|| ||V||+1, 
||/C||<(||A-||||V|| + 1) 2 . 

The proof is similar to the proof of the second part of Remark [T] and it is omitted here. 

• Without the secrecy parameter R e , the region TZ^ is exactly the same as the capacity region for the degraded 
broadcast channel with causal side information IT4l . 

• The points in TZ^ for which R e = R\ are of considerable interest, which imply H{S) = H(S\Z N ). 
Definition 8: (The secrecy capacity ci*^) The secrecy capacity ci^ of the model of Figure [2] with causal 
side information and without feedback, is denoted by 

Ci c) = max Ri. (2.10) 

(c) 

Furthermore, the secrecy capacity Cs satisfies 

Ci c) > max(I(K;Y\U) - I{K;Z\U)). (2.11) 
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Proof of (2.11\: Substituting R e = Ri and R = into the region TZ^ in Theorem [3] we have 

Ei < I(K;Y\U), 

Ri < I(K;Y\U)-I(K;Z\U). 

Note that the triple (R = 0, i?i = max^ii'; Y"|Z7) - I(K; Z\U)),R e = R\) is achievable, and therefore, the 
secrecy capacity cj, > max(I(K; Y \U) — I(K; Z\U)). Thus the proof is completed. ■ 
Theorem 4: A single-letter characterization of the region is as follows, 

K {co) ={(R ,R 1 ,R e ) :0<R e < R u 

Ro<I{U;Z), 

Rx<I{K-Y\U), 

R e <I{K;Y\U)-I(K;Z\A)}, 

where pukavxyz(u, k, a, v, x, y, z) = Pz\y{z\v)py\x,v{v\x, v)p X \uAKv(x\u, a, k, v)p UK a(u, k, a)p v {v), which 
implies that ({/, K, A) -> (X, V) -> Y -> Z and V is independent of U, K and A. 
The region satisfies C K {co \ 

Remark 4: There are some notes on Theorem |4] see the following. 

• The region is convex, and the proof is omitted here. 

• The ranges of the random variables U and K satisfy 

IM<||*||||v|| + i, 

\\A\\<\\X\\\\V\l 

\\ic\\<{\\x\\\\v\\ + m\x\\\\vi 

The proof is similar to the proof of the second part of Remark [T] and it is omitted here. 

• Without the secrecy parameter R e , the region is exactly the same as the capacity region of the degraded 
broadcast channel with causal side information [14|. 

III. The Model of Figure[2]with Feedback 

In this section, the model of Figure [2] with feedback is considered into two parts. The model of Figure [2] with 
noncausal side information and feedback is described in Subsection |III-A| and the model of Figure [2] with causal 



side information and feedback is described in Subsection III-B see the following. 



A. The Model of Figure [2] with Noncausal Side Information and Feedback 

The model of Figure [2] with noncausal side information and feedback is similar to the model in Subsection 
except that there is a noiseless feedback from receiver 1 to the channel encoder. The feedback Y 1 ^ 1 (where 



H-A 



2 < i < N and Y l 1 takes values in y l x ) is the previous i — 1 time output of channel 1. At the i-th time, 
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the inputs of the channel encoder are S, T, Y l 1 and V , while the output is X i7 i.e., the channel encoder is a 
mapping 

fi -.SxTxy 1 - 1 x V N ^ X u (3.12) 

where f^s, t, y*' 1 , v N ) = x. t e X, s eS,t € T, y^ 1 € J^ 1 and v N e V N . 

The following Theorem [5] and Theorem [6] provide inner and outer bounds on the capacity-equivocation region 

fcinf) for the model of p igure U 

with noncausal side information and feedback, and they are proved in Appendix 

|F|and Appendix [G] 

Theorem 5: A single-letter characterization of the region 

Kinfi) 

is as follows, 

U (nfi) = { ( RQjRliRe ) :0<R e <R u 

Ro<I(U;Z)-I(U;V), 

iZi <I(K-Y\U)-I{K-V\U), 

R e < H(Y\Z)}, 

where p UKVX Yz{u,k,v,x,y, z) = p z \ Y (z\y)p Y \x,v{y\x,v)PUKXv(u,k,x,v), which implies that (U,K) -t 

(x,v) ->y -> z. 

The region ftfr-W satisfies C ftW). 

Remark 5: There are some notes on Theorem [5] see the following. 

• The region is convex, and the proof is omitted here. 

• The ranges of the random variables U and K satisfy 

l|W||<||*||||V||+2, 
||/C||<(||Af||||V||+2)(P||||V|| + l). 

The proof is similar to the proof of the second part of Remark [T] and it is omitted here. 

• Without the secrecy parameter R e , the region TZ.( n fv is exactly the same as the achievable region for the 
degraded broadcast channel with noncausal side information [14|. 

• The points in TZ^' for which R e — Ri are of considerable interest, which imply H(S) = H(S\Z N ). 
Definition 9: (The secrecy capacity Ci"^" 1 ) The secrecy capacity ci"^ of the model of Figure [2] with 
noncausal side information and feedback, is denoted by 

C { s nf) = max (3.13) 

Furthermore, the secrecy capacity Cs' satisfies 

Ci nf) > max.mhi{H(Y\Z),I(K;Y\U) - I(K;V\U)}. (3.14) 
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Proof of (3.14): Substituting R e = Ri and R Q — into the region 1Z ("-^ in Theorem [5] we have 

R 1 < I{K;Y\U) - I(K;V\U), 
Ri < H(Y\Z). 



Note that the triple (R = 0,R 1 = maxmin{77 (Y\Z), I(K; Y \U) - I(K;V\U)},R e = R x ) is achievable, 
and therefore, the secrecy capacity ci™^ > maxmhi{H(Y\Z),I(K; Y\U) — I(K;V\U)}. Thus the proof is 
completed. ■ 
Note that the formula R e < I(K; Y\U) - I(K; Z\U) of Theorem [T] can be bounded as follows. 

Re < I{K-Y\U)-I{K-Z\U) 
= H{K\U,Z)-H{K\U,Y) 
( = 5 H(K\U,Z)-H(K\U,Y,Z) 

= I(K] Y\U, Z) < H(Y\U, Z) < H{Y\Z), (3.15) 
where (a) is from K — > (U, Y) — s- Z. 

Formula (3.15 1 implies that the feedback helps to enlarge the region 

ftM in Theorem [I] 



Theorem 6: A single-letter characterization of the region 

Uinfo) 

is as follows, 

n (nfo) = {(R OjRuRe ) :0 < Re < Rlj 

Ro<I(U;Z)-I(U;V), 

Ri < I(K;Y\U,A) - I(K;V\U,A), 

Ro + Ri< I{U, A, K; Y) - 7(17, A, K; V), 

R e < H{Y\Z)}, 

where pukavxyz{u, k, a, v, x, y, z) = Pz\y{z\v)py\x,v{v\x, v)p UKA xv(u, fc, a, x, v), which implies that (J7, K, A) 
(X, V)^Y Z. 
The region K inf °) satisfies H^ nf > C lZ^ nfo \ 

Remark 6: There are some notes on Theorem [6] see the following. 

• The region is convex, and the proof is omitted here. 

• The ranges of the random variables U and K satisfy 

IM<ll*lll|V||+2, 
MI|<(||Af||||V||+2)(||^||||V||+l), 
||/C||<(p||||V||+2)(||A-||||V||+l)(p||||V||+2) 2 . 
The proof is similar to the proof of the second part of Remark [T] and it is omitted here. 
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• Without the secrecy parameter R e , the region 

K(nfo) is exact i y the 

same as the outer bound for the degraded 

broadcast channel with noncausal side information [14|. 

B. The Model of Figure [2] with Causal Side Information and Feedback 

The model of Figure [2] with causal side information and feedback is similar to the model in Subsection |III-A| 
except that the side information V N is known to the channel encoder in a causal manner, i.e., at the z-th time 
(1 < i < N), the output of the channel encoder Xi = fi(s,t,y' l ~ 1 ,v l ), where v % = («i,i>2, ■■■,Vi) and fa is the 
mapping at time i. 

The following Theorem [7] provides the capacity-equivocation region TZ^^ for the model of Figure [2] with causal 
side information and feedback, and it is proved in Appendix [H] 

Theorem 7: A single-letter characterization of the capacity-equivocation region is as follows, 

llM = {{Rq, Rl, Re) ■ < R e < Rl, 

R <I(U;Z), 

Ri<I(K;Y\U), 

R e <H(Y\Z)}, 

where p UKVX Yz(u,k,v,x,y, z) = pz\Y(z\y)p Y \xy(y\x,v)p X \uKv(x\u,k,v)p UK (u,k)p v (v), which implies 
that (U, K) -> (X, V) -> Y -> Z and V is independent of U and K. 
Remark 7: There are some notes on Theorem [7] see the following. 

• The region is convex, and the proof is omitted here. 

• The ranges of the random variables U and K satisfy 

\\U\\ < ||*|| HVII + 1, 
||/C||<(||^||||V|| + 1)||^||||V||. 

The proof is similar to the proof of the second part of Remark [T] and it is omitted here. 

• Without the secrecy parameter R e , the region TZ^ C ^ is exactly the same as the capacity region for the degraded 
broadcast channel with causal side information lfl4l . i.e., the noiseless feedback can not increase the capacity 
of the degraded broadcast channel with causal side information. 

• The points in TZ^ C ^ for which R e = R\ are of considerable interest, which imply H(S) = H(S\Z N ). 
Definition 10: (The secrecy capacity C'i c ^) The secrecy capacity ci c ^ of the model of Figure [2] with causal 
side information and feedback, is denoted by 

Ci c/) = max Ri. (3.16) 

Furthermore, the secrecy capacity C's satisfies 

Ci c/) =maxmin{I(K;Y\U),H{Y\Z)}. (3.17) 
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Proof of (3.17 'I: Substituting R e = Ri and R = into the region TZ^^ 1 in Theorem]?] we have 

Ri <I(K;Y\U), (3.18) 
Ri < H{Y\Z). (3.19) 



By using (3.16 1, (3.18 i and (3.19 1, (3.17l is proved. 



• By using the same formula as ( 3.15 I, it is easy to see that the feedback helps to enlarge the region in 
Theorem [3] 

IV. A Binary Example for the Model of Figure[2]with Causal Side Information and Noiseless 

Feedback 

In this section, we calculate the secrecy capacity of a special case of the model of Figure [2] with causal side 
information and noiseless feedback. 

Suppose that the channel state information V N is available at the channel encoder in a casual manner, and the 
random variable V is uniformly distributed over {0, 1}, i.e., pv(0) — Pv(l) = \- Meanwhile, the random variables 
X, Y and Z take values in {0, 1}, and channel 2 is a BSC (binary symmetric channel) with crossover probability 
q. The transition probability of channel 1 is defined as follows: 

When v = 0, 



Py\x,v(v\x,v = 0) = 



l-p, Ky = x, 

p, otherwise. 



py\x,v(v\x,v = 1) = 



When v = 1, 

p, if y = x, 

1 — p, otherwise. 
From Remark [7] we know that the secrecy capacity for the causal case is 

C { s cf} =maxmin{I(K;Y\U),H{Y\Z)}. 



(4.1) 



(4.2) 



(4.3) 



Since Rq = 0, we have U = const, and therefore, I(K;Y\U) — I(K;Y). The remainder of this example is to 
calculate the characters max I(K; Y) and max H(Y\Z). 

Let K take values in {0, 1}. The probability of K is defined as follows. Pk(0) = a and j»a'(1) =1 — ex.. 

In addition, define the conditional probability mass function Px\k,v as follows. 

Px\kA q \°i 0) = fa, Px\K,v(m 0) = 1 - fSt, p X \ K y(0\0, 1) = 02, Px\k,v(1\0, 1) - 1 - (3 2 , 

PX\K,v(0\h0) = 03, PJT|Jf,v(l|l,0) = 1-03, PX\K,v(P\l,l) =04, VX\K ,V (M h 1) = 1 ~ 

The character I(K\ Y) depends on the joint probability mass functions pky, an d we have 
PKY(k,y) = ^2p K Yxv(k,y,x,v) 

X,V 

= ^2pY\xv(y\x,v)p X \K,v(x\k,v)p K (k)p v (v). (4.4) 

x,v 

Now we calculate meotI(K;Y) and max. H(Y\Z), respectively. 
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(Calculation of max I(K;Y)) Since 

/(^) = ££^(fc,2/)log PK ^ V \ , (4.5) 

V y PK(k)p Y (y) 

and 

Pky(0,0) = -[l-(/3 1 -/3 2 )(l-2p)] ; (4.6) 

Pxr(0,l) = |[l + (/3i-/3 2 )(l-2p)], (4.7) 

Q; 

Pxr(l,0) = ^ [1 - (/?3 - &)(1 - 2p)], (4.8) 

P*y(l, 1) = 2 [1 + (A» - /? 4 )(1 - 2p)]. (4.9) 

Define j3 1 — (3 2 — a and /3 3 — /? 4 = 6, then we have 

/(if; y) < /i(l — a—p+ 2pa) - h{p) < 1 - (4.10) 

where h(x) = —x log x — (1 — .x) log(l — a;), < a; < 1, and the "=" is achieved if a = 1, 6 = — 1 and a = |. 
The character H(Y\Z) is calculated as follows. 



H(Y\Z) = H(Z\Y) + H(Y)-H(Z) 

< H(Z\Y) = h(q), (4.11) 

and therefore, ma,xH(Y\Z) = h(q). Note that a = 1, 6 = — 1 and a = \ are also the distributions for 
H(Y\Z) = h(q). 
Thus, the secrecy capacity of this special case is 

C { s cf) = min{l - h(p), h{q)}. (4.12) 

V. Conclusion 

In this paper, we investigate the model of degraded broadcast channel with side information, confidential messages 
and with or without noiseless feedback. This work is from Steinberg's work on the degraded broadcast channel 
with causal and noncausal side information, and Csiszdr and Korner's work on broadcast channel with confidential 
messages. For the non-feedback model, inner and outer bounds on the capacity-equivocation regions are provided 
for both causal and noncausal manners. Superposition coding, Gel'fand-Pinsker's binning and Wyner's random 
binning technique are used in the corresponding achievability proofs. 

For the feedback model, inner and outer bounds on the capacity-equivocation region are provided for the noncausal 
case, and the capacity-equivocation region is determined for the causal case. In the corresponding achievability 
proofs, the noiseless feedback is used as a secret key shared by receiver 1 and transmitter, and therefore, the coding 
schemes for the achievability proofs are a combination of superposition coding, Gel'fand-Pinsker's binning, block 
Markov coding and Ahlswede-Cai's secret key on the feedback system. 

Finally, we give an example on calculating the secrecy capacity of the binary degraded broadcast channel with 
causal side information, confidential messages and noiseless feedback. 
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Appendix A 
Proof of Theorem[T] 

In this section, we will show that any triple (R , R e ) £ 1Z ni is achievable. Superposition coding, Gel'fand- 
Pinsker's binning and Wyner's random binning technique are used in the construction of the code-books. 



Now the remainder of this section is organized as follows. The code construction is in Subsection A-A The 



proof of achievability is given in Subsection |A-B| 
A. Code Construction 

Since R e < I(K;Y\U) - I(K;Z\U) and R e < R 1 < I(K;Y\U) - I(K;V\U), it is sufficient to show that 
the triple (i? , Ri,R e = I(K; Y\U) - max(J(AT; Z\U),I(K; V\U))) is achievable, and note that this implies that 
Ri > Re = I(K; Y\U) - max(I(K; Z\U),I(K; V\U)). 

Given a triple (Rq, Ri, R e ), choose a joint probability mass function pu,k,v,x,y,z{u, k, v, x, y, z) such that 

< R e < R u 
Ro<I(U;Z)-I(U;V), 
Ri<I(K;Y\U)-I(K;V\U), 
R e = I(K; Y\U) - nmx(I(K; Z\U),I(K; V\U). 
The confidential message set S and the common message set T satisfy the following conditions: 

Jim i log || S ||= Rx = I(K- Y\U) - I(K; V\U) - 7 i, (13) 
lim ilog||r||=iJo = /(^;Z)-/(l7;V)-7 ) (14) 

N— too iV 

where 7 and 71 are fixed positive real numbers and 

< 71 < (a) max(/(if; Z\U),I{K; V\U)) - I(K; V\U). (15) 



Note that (a) is from R x > R e = I(K;Y\U) - max(I(K;Z\U),I(K;V\U)) and ([13). Let S = {1, 2, 2 NRl } 
and T = {1,2, ...,2 NR «}. 
Code-book generation: 

• (Construction of U N ) Gel'fand-Pinsker's binning technique is used in the construction of U N , see Figure [3] 
Generate 2 N ^ II - U ' Z ^~ ei - N ^ (ei t N — >• as N — > 00) i.i.d. sequences u N , according to the probability mass 
function pu(u). Distribute these sequences at random into 2^° = 2 N ( I ( U ' Z * > ~ I ( U ' V ' > ~ 1 ' > bins such that each 
bin contains 2 N( - I{u ' y ^ + '<~ tl - N ^ sequences. Index each bin by i £ {1,2, ...,2^°}. 
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For a given common message t (t £ T) and side information i; , try to find a sequence in bin t 
{u N (t,l),u N [t,2),...,u N (t,2 N W u ' v ^-^> N '>)} that is jointly typical with v N , say u N (t,i*), i.e., 
(u N (t,i*),v N ) £ Tff v {e\). If multiple such sequences in bin t exist, choose the one with the smallest i*. If 
no such i* exists, then declare an encoding error. 



J 2*W:V')-?-«i.»> codewords per bin 

> 2 N ^ u '^- ei ^ codewords of U N 
Number of bins 2 NR ° = 2 *'('(^z>- 



Fig. 3: Code-book construction for U N in Theorem 



(Construction of if 1 *) Classical superposition coding and double binning technique Q are used in the 
construction of K N , see Figure [i] 

For the transmitted sequence u N (t,i*), generate 2 N ( I ( K ' Y \ u " l ~ e2 ' N ' > (e2 at — > as iV — > oo) i.i.d. sequences 
k N , according to the probability mass function PK\u{ki\Ui{t : i*)). Distribute these sequences at random into 
2iVR! = 2 W(K;Y\u)-i(K;V\U)- 11 ) bins such that each bin contains 2 N ( I (K:V\u)+ 11 -e 2 ,N) sequences. Index 

each bin by i e {1, 2, 2 NRl }. Then place the 2 Ar ( / ( K;V l c/ ) +7l ~ e2 < JV ) sequences in every bin randomly 
into 2 N ^ max ^^ K ' V ^' I(K ' Z ^~^ K ' Z ^ +e3 ^ (e 3jA r -> as N ->■ oo) subbins such that every subbin 
contains 2 N ^ K <v\U)+i/i-e 3 , N -x ^(i(K;V\U),l(K-,z\u))+l{K-,z\u)-e 3 , N ) sequences. Let J be the random variable 
to represent the index of the subbin. Index each subbin by 

j £ {i : 2,... 7 2 N ( ma *( I ( K '' V \ u ^ I ( K > z \ u ^~ I ( K ' Z \ u ') +e3 ' N ^}, i.e., 



log \\J\\ = N(max{I(K; V\U),I(K; Z\U)) - I(K; Z\U) + e 3 , N ). 
Here note that the number of the sequences in every subbin is upper bounded as follows. 

I(K; V\U) + 7l - e 2 , N - max(/(X; V\U),I(K; Z\U)) + I(K: Z\U) - e 3 ,jv 
<(») I(K;Z\U)-e 2 ,N-e 3tN , 
where (a) is from {15) . This implies that 



(16) 



lim H(K N \U I \S,J,Z N )=0. 

TV— J-OO 



(17) 



(18) 



For a given confidential message s (s £ S) and side information v N , try to find a sequence k N (u N (t,i*)) in 
bin s such that (k N (u N (t,i*)),v N ) £ T^ v < u (e2). If multiple such sequences in bin s exist, choose the one 
with the smallest index in the bin. If no such sequence exists, declare an encoding error. 



16 



2 N(i(K:ViU)+ yi -^, N ) codewords pe,. b i n 

2J*f(/(KiV|tfH- n -€ a , w -m«(/(K;V|t/>,i(JC(Z|tf»+/(K^|t/)-€ ) , rW ) pe,. subbin 

2 *(™x(J(X;V| t /),i(X;Z|C/))-J(K ; Z|t/) +t ,, w ) ^ ^ 
2 N(J(K;r|[/)- £2 „ v ) codewords of K *i 

Number of bins = ^(m-r\u)-l(K;V\u)-^) 



Fig. 4: Code-book construction for K N in Theorem jlj 



• (Construction of X ) The a; is generated according to a new discrete memoryless channel (DMC) with 
inputs k N , u , v , and output x N . The transition probability of this new DMC is Px\u,k,v(x\u, k, v), which 
is obtained from the joint probability mass function pu,k,v.x,y,z(u, k, v, x, y, z). The probability 
Px N \u N ,k n ,v N (x \u N , k , v ) is calculated as follows. 

N 

Px N \u N ,k n ,v N {x N \u N ,k N ,v N ) = Y\_Px\U,K,v(Xi\Ui,ki,Vi). (19) 

»=i 

Decoding: 

Receiver 2: Given a vector z N £ Z N , try to find a sequence u N (t 7 i) such that (u N (t,i), z N ) £ Tff z (es). If 
there exist sequences with the same t, put out the corresponding t. Otherwise, i.e., if no such sequence exists or 
multiple sequences have different message indices, declare a decoding error. 

Receiver 1: Given a vector y N E y N , try to find a sequence u N such that (u ,y ) £ T^ Y { e 4:)- If suc h a sequence 
does not exist, or there are more than one such sequence, declare a decoding error. Denote the corresponding 
sequence by u N (t, i), put out the corresponding index t. 

After decoding u N (i, i) and i, try to find a sequence k N (u N {i, i)) such that (k N (u N (i, i)), y N ) £ T^ Y , u (e 5 ). If 
there exist sequences with the same index of the bin s, put out the corresponding s. Otherwise, declare a decoding 
error. 

B. Proof of Achievability 

By using the above definitions, it is easy to verify that lirnjv-^oo 1 ° s | r 7 '^ = Ro an d limjy^co log jJ r ' s ^ = R\. 

Then, note that the above encoding and decoding scheme is similar to the one used in lfT4ll . Hence, by similar 
arguments as in lfl4ll . it is easy to show that P e i < e and P e 2 < e, and the proof is omitted here. It remains to 
show that limjv-yoo A > i? e , see the following. 



lim A = lim —H(S\Z N ) 

N^oo N-yoa N 

> lim 1=H(S\Z N ,U N ) 

= lim ~(H(S,Z N ,U N )- H(Z N ,U N )) 

= lim ^-(H(S, Z N , U N , J, K N ) — H(J, K N \Z N , U N , S) — H(Z N , U N )) 

( = } lim -(H(Z N \U N ,K N ) + H(U N , J, K N , S) ~ H(J, K N \Z N ,U N , S) - H(Z N ,U N )) 

iV-^oo N 

( A) lim L( H (Z N \U N ,K N ) + H(U N ,K N )-H(J,K N \Z N ,U N ,S)-H(Z N ,U N )) 
= ii m -(H(K N \U N ) - H(J,K N \Z N ,U N ,S) - I(Z N ;K N \U N )) 

N—tOO N 

= lim ^-(H(K N \U N )-H(J\Z N ,U N ,S)-H{K N \Z N ,U N ,S, J) - I{Z N ;K N \U N )) 

> lim ~(H(K N \U N )-log\\J\\-H(K N \Z N ,U N ,S,J)-I(Z N ;K N \U N )) 

> lim ^-{H(K N \U N ) - H(K N \U N ,Y N ) - log \\J\\ - H{K N \Z N , U N , S, J) ~ I(Z N ; K N \U N )) 
= lim ±-(I(Y N ;K N \U N )~\og\\J\\-H(K N \Z N ,U N ,S,J)-I(Z N ;K N \U N )) 

N—tOO T\ 

( =' lim UNI(Y;K\U) -log\\ J\\-H(K N \Z N ,U N ,S, J) -NI(Z;K\U)) 

N—t-oo iV 

= lim —(NI{Y; K\U) -Nmax(I{K; V\U),I(K; Z\U)) + NI(K; Z\U) - Ne 3 N - NI(Z; K\U)) 

iV— S-OO N 

= I(Y;K\U)-max(I(K;V\U),I(K;Z\U))=R e , (20) 

where (a) is from (S, J) -> (U N ,K N ) ~> Z N , (b) is from H{J, S\U N ,K N ) = 0, (c) is from H(J\Z N , U N , S) < 
H(J) < log 1 1 J7"| | , (d) is from that V N , U , K N and X N are i.i.d. generated random vectors, and the channels 



are discrete memoryless, (e) is from ( 16 1 and ( 18 1, and (f) is from e 3 pj —> as A" — > oo. 



Thus, limAT^oo A > R e is proved, and the proof of Theorem [T] is completed. 

Appendix B 
Proof of Theorem|2] 

In this section, we prove Theorem[2j all the achievable (Rq, R±, R e ) triples are contained in the set TZ^ no \ Suppose 
(Ro, Ri, R e ) is achievable, i.e., for any given e > 0, there exists a channel encoder-decoder (N, A, P e i, P e 2) such 
that 

l im l0g | T 11 = Ro, lim lQg H 511 =R U lim A > R e , P el < e, P e2 < e. 
Then we will show the existence of random variables (U, K, A) —> (X, V) —}Y —> Z such that 

0<Re<Rl, (21) 

Ro<I(U;Z)-I(U;V), (22) 
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Ri < I(K;Y\U,A) - I(K;V\U,A), 
Rq + Ri< I(U, K, A; Y) - I(U, K, A; V), 
R e < I(K; Y\U, A) - I{K; V\U, A) - I(K; Z\U) + I(K; V\U). 



(23) 
(24) 
(25) 



Since S and T are independent and uniformly distributed over S and T, we have H(S) = log || S || and 
H(T) = log || T ||- The formulas (22 1, (23 1, (24i and (25 1 are proved by LemmaJI] see the following. 

Lemma 1: The random vectors Y N , Z N and the random variables S, T, V, U, K, A, Y, Z of Theorem [2] 
satisfy: 



^H(T) < I(U; Z) - /([/; V) + ^(P e2 ), 
^H(S) = jjH(S\T) < I(K; Y\U, A) - I(K; V\U, A) + ^S(P el ) 7 
^H(S, T) < I(U, K, A; Y) - I(U, K, A; V) + ^S(P el ), 



(26) 
(27) 
(28) 



N 



H{S\Z N ) < I(K;Y\U,A)-I(K;V\U,A)-I(K;Z\U)+I(K;V\U) 



1 



1 



+ ^(P el ) + -S(P e2 ) 



N N 

where S(P el ) = h(P el ) + P e ilog(|S x T| - 1) and S(P e2 ) = h(P e2 ) + P e2 log(|T| - 1). Note that h(P el ) 
-P e i logP el - (1 - P el ) log(l - P e i) and h(P e2 ) = -P e2 logP e2 - (1 - P e2 ) log(l - P e2 ). 



(29) 



Substituting H(S) = log || S ||, H{T) = log || T ||, H(S,T) = H(S) + H(T) and §24$ into (|26j), (|27j, <|28j) 
and (29 1, and using the fact that e — >• 0, the formulas ( p2| ), p3j, ( |24| and ( p5] l are obtained. The formula (21 1 is 
from 

R e < lim A= lim —H(S\Z N ) < lim — H(S) = Rl 

It remains to prove Lemma [TJ see the following. 

Proof of Lemma^ The formula ([26]) follows from ([30]), ([34]) and ||44}. The formula ([27]) is from ([31}, ([36]) 



and (|48j>. The formula ([28]) is from ([32]), ((38) and g9J. The formula (|29) is proved by ([33]), ([36), <[39), (gSJ and 
501 



<Part i> We begin with the left parts of the inequalities (26 1, ( |27) , (28 1 and (29 1, see the following. 
Since T — > y w — > Z w is a Markov chain, for the common message T, we have 



1 

N 



H(T) = 



1 

N 
1 



H(T\Z N ) + ±-I(Z N ;T) 



N 



< (a) -<5(P e2 ) + ^/(^;T). 



(30) 



For the confidential message S, we have 



1 

N 



H(S) = 



< 

<(*>) 



±H(S\T) = ±I(S;Y N \T) + ±H(S\Y N ,T) 
±I(S;Y N \T) + ±H(S,T\Y N ) 
±I(S;Y N \T) + ±5(P el ). 



(31) 
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For 5 and T, we have 

1 (i/(5) + H(T)) = iff (5, T) = 1/(5, T;Y N ) + iff (5, T\Y N ) 

<(c) 1/(5,2^) + i*(P e i). (32) 
For the equivocation to the receiver 2, we have 

~H{S\Z N ) = ^H(S\Z N ,T) + ±I(S;T\Z N ) 

= ±H(S\T) i/(5; Z"|T) + ~I{S;T\Z N ) 
= 1/(5; F W |T) + ~H(S\Y N , T) - 1/(5; Z W |T) + 1/(5; T|Z W ) 
< 1/(5; Y N \T) - 1/(5; Z^T) + ^H(S,T\Y N ) + ^H(T\Z N ) 
< {d) ^I(S;Y N \T)-^I(S;Z N \T) + ^6(P el ) + ±5(P e2 ). (33) 
Note that (a), (b), (c) and (d) follow from Fano's inequality. 

<Part ii> By using chain rule, the character I(Z N ;T) in formula ( 30 » can be bounded as follows, 



1 1 N 

w i(z N ;T) = -^/(z^riz- 1 ) 



=i 
i N 

- Jl) n T|Z " 1} " I{Vl '- m + l]) 

i=l 



1 N 

- T, V^lZ*- 1 ) I(Z f , V&IT, Z*- 1 ) - I{Vi\T, Z^\V^) + J(K; Z i_1 \T, V&)) 

i=l 
1 N 

- YXW;T, v^iz*- 1 ) - m, T, z^w^)) 

i=l 

1 N 

- ]T(ff (Z^ 1 ) H{Z i \Z i ~\T, V&) - H{V t ) + H(V t \Z*-\T, V&)) 

i=l 
1 N 



=i 

N 

-(2) 

N 

i=i 

N 

-(3) 

N 

i=l 
N 



i=l 

where formula (1) follows from that Vi(l < i < N) are i.i.d. random variables and they are independent of T, 
formula (2) follows from that 

N N 

^miV&frZ*- 1 ) = J2m; Z'-^V^), (35) 

i=l i=l 

and formula (3) follows from that Vi(l < i < N) are i.i.d. random variables. 
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<Part iii> Using chain rule, the character I(S;Y \T) in formulas (31 1 and (33 1 can be rewritten as follows 



N 



1 N 

I(Y N ;S\T) = -^HY^SIY^.T) 



i=l 
N 



(Q) jj X>( Y - S\Y l -\T) - J(V5; S\V» lt T)) 
1 N 

- J2( T ( Y i> S > V^-\T) - m- VH X \T, Y*- 1 , S) - I(V f , S, r-^^T) 



z (b) 



N ■ 

I(V i ;Y i - 1 \T,S,V£ l )) 

- £(J(Y«; S, V^-\T) - I(Vi; S, Y^W^.T)) 

- Y,(H(S, V&IY*- 1 , T) - H (S, V^Y*- 1 , T, Y) - H(S, Y*" 1 \V» 1: T) 

i=l 

- ^(HiV^lY^ 1 , T) + H(S\Y*-\ T, V&) - HiV&lY*- 1 , T, Y t ) 



N 



-(c) 



i7(S|Y*-\T, Y,V^+i) - H{Y^\V^ +1 ,T) - H{S\V^,T,Y 

HiY^V^T^i) + HiSlV^T^Y*- 1 )) 
1 N 

^(I(V» 1 ;Y i \Y i -\T)-I(Y i - 1 ;V i \V» 1 ,T) + 
7(5; Y\Y l -\ T, V^) - 7(5; V^Y*" 1 , T)) 
- £(7(5; Y|Y*-\ T, V&) - 7(5; V l \Y i ~ 1 , V^^T)) 



ri-l\ 



i=l 
N 



i £(7(5; Y|Y i_1 ,T, Z*" 1 ) - 7(5; V^Y'" 1 , Vfti.T, Z*" 1 )), 



(36) 



where formula (a) follows from Vi(l < i < N) are i.i.d. random variables and they are independent of T, 5, 
formula (b) follows from 

JV N 

]T I(Y; V&IT, Y*-\S) = Y / m-^-^T, S, (37) 



i=l 



formula (c) follows from (35 1, and formula (d) is from the Markov chains Yi — > (Y l 1 ,T, V^ x ) — > Z' 1 1 , Yi — > 
(y^.T.V&.S) -> Z<-\ Vt -> (Y* _1 ,T, V^+i) -> Z''- 1 and -> (Y*" 1 , T, 5) -> Z 1 - 1 . 
<Part iv> Similar to (34i, the character I(S,T;Y N ) in formula (32i can be rewritten as follows, 
1 1 N 

-7(5, T; y") < - £(77^) # (Y|Y>-\ T, 5, V&) - H(Vi) + 77(Vj|Y i_1 ,T, 5, V&)). (38) 
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<Part v> Similar to (36 1, the character I(S; Z N \T) in formula (33 1 can be rewritten as follows, 

N 



l -I{S;Z N \T) = ±<T,I(Z i ;S\Z i -\T) 

i=l 
1 N 

- £(/(z i; s\z*-\t) - i(v i; s\v^,t)) 
i=i 

i N 

- £(I(Z i; 5, V^Z^T) - J(Z i5 K^IT, Z*- 1 , 5) - 7(K; S, Z^IV^T) 



N 
i=l 



1 w 

- £(/(Z t ; 5, V^IZ-ST) - 7(^5, Z^I^T)) 



N 

i=l 



1 * 



N 

i=l 



I{S-Z l \Z*-\T,V l N +1 )-I{S-V l \Z l -\V l N +1 ,T)) 
1 w 

= ^^ (7(5;Zil ^" 1 ' T '^+ l)_7( ' S;yi|z " 1 ' v ' i + l!T)) - (39) 

<Part vi> (single letter) To complete the proof, we introduce a random variable J, which is independent of S, 
T, X N , V N , Y N and Z N . Furthermore, J is uniformly distributed over {1,2, ...,N}. Define 

U=(T,Z J - 1 ,Vf +1 ,J), (40) 

K = S, (41) 

A=(r J - 1 ,J), (42) 

X = Xj,Y = Yj,Z = Zj,V = Vj. (43) 

<Part vii> Then ( |3"4"| > can be rewritten as 
1 1 W 

-7(T;Z*) < -Y,(H(Z i )-H(Z i \Z i -\T,V£ 1 )-H(V i ) + H(V i \Z i -\T,V» 1 » 

i=l 
1 - 

= ]v = z) _ ff (^l^ _1 . r . ^i+i. J = - H (W = i ) + H(V t \Z*-\T, VJl^J = »)) 



»=i 

= ff(Z J |J)-/7(Z J |Z J - 1 ,T,^ 1 ,J)-77(y J |J) + ff(10|Z J - 1 ) T ) ^ 1) J) 
=W H(Zj\J) - H(Zj\Z J -\T, Vf +1 ,J) - ff(Vj) + 77(V>|Z" 7 ~ 1 ,T', Vf +1 , J) 
= H(Z\J) - H(Z\U) - H(V) + H(V\U) 
< H(Z) - H(Z\U) - H(V) + H{V\U) 

= I(U;Z)-I(U;V), (44) 
where (a) follows from the fact that Vj is independent of J. 
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Proof of p(Vj = v,J = i) = p(Vj = v)p(J = i): Since V N is the output of a discrete memoryless source 
Pv{v), we have 

p(Vi = v) = P(V = v). (45) 
From <Part vi>, we know that the random variable J is independent of V N , and therefore, 

P (Vj =v,J = i)= p(Vi =v,J = i)= p(Vi = v)p(J = i) =« P(V = v)p(J = i), (46) 



where (1) follows from ( |45| . 

On the other hand, the probability p(Vj = v) can be calculated as follows, 



N 



N 



p(Vj = v) 



^2p(Vj =v,J = i) = Y,p(Vi — v,J = i) 



i=\ 
N 



A' 



-(b) 



J2P(V = v)p(J: 



i=l 



i=l 



N 



P(V = v)J2p(J = = P(V = v), 



(47) 



where (a) is from that J is independent of V , the formula (b) is from (145 1 



By using ( 46 » and (47 i, it is easy to verify that Vj is independent of J, completing the proof. 



Analogously, ( 36 1 is rewritten as follows, 
1 1 N 

-I(Y N ; S\T) < - J2(I(S; Yi\Y*-\T, V^Z^) - I(S; V^W^T, Z^)) 



i=l 
N 



~ J2(I(S; Y t \Y l ~ l ,T, V^Z*- 1 , J = i) — I(S; V t \Y*~\ v£ ± ,T, Z l ~\ J = »)) 



N 

= I(S; Yj\Y J ~\T, Vj N +v Z J -\J) - I(S; Vj\Y J ~\T, Vj N +1 ,Z j ~\ J) 
=(°) I(K;Y\U,A)-I(K;V\U,A), 
where (a) follows from (|40]) ,(|4T]>, ( |42| ) and ( |43| l. 



(48) 



Similarly, (38i is rewritten as follows, 

1 



N 



I(S, T; Y ) < I(U,K, A; Y) — I[U,K, A; V), 



and d39p can be rewritten as follows, 



N 



I(S; Z N \T) = I(K; Z\U) - I{K; V\U). 



(49) 



(50) 



Substituting (g4j), (|49]>, (|50| into ([30]), ((3T), ((32]) and Lemma [T] is proved. 
The proof of Theorem [2] is completed. 
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Appendix C 

Size Constraint of The Random Variables in Theorem Q] 

By using the support lemma (see [5 1, p.310), it suffices to show that the random variables U and K can be replaced 
by new ones, preserving the Markovity (U, K) — > (X, V) — > Y — > Z and the mutual information I(U ; Z), I(U; V), 
I(K; Y\U), I(K; V\U), I(K; Z\U), and furthermore, the ranges of the new U and K satisfy: ||W|||| < ||Af||||V||+2, 
||/C|| < (||Af||||V|| + 2) 2 . The proof is in the reminder of this section. 

Let 

P = Pxv{x,v). (51) 
Define the following continuous scalar functions of p : 

fxv(p)=Pxv(x,v),f Y (p) = H(Y)J z (p) = H(Z),fy(p) = H(V). 

Since there are ||A'||||V|| — 1 functions of fxv(p)i me tota l number of the continuous scalar functions of p is 
ll*lll|V||+2. 

Let pxv\u = Pr{X — x, V = v\U = u}. With these distributions pxviu = Pr{X = x, V = v\U = u}, we 
have 

pxv(x,v) = ^p{U = u)fxv(Pxv\u), (52) 
ueu 



I(U; Z) = f z {p) -J2p(U = u)fz{p xv \u), (53) 
ueu 

I(U;V) = fvip) -Y.P( U = u )fv(Pxv\u), (54) 

H(Y\U) = J2p(U = v)fy(pxv\u), (55) 
ueu 

According to the support lemma (0, p. 3 10), the random variable U can be replaced by new ones such that the 



new U takes at most ||«V||||V|| + 2 different values and the expressions (52 1, (53 I, (54 1 and (55 i are preserved. 

Once the alphabet of U is fixed, we apply similar arguments to bound the alphabet of K, see the following. 
Define ||Af||||V|| + 2 continuous scalar functions of pxv ■ 

fxv(Pxv)=Pxv(x,v)J Y (p xv ) = H(Y),f z (pxv) = H(Z)J v (p xv ) - H(V), 

where of the functions fxv(Pxv), on ly ll^llll^ll — 1 m ' e t0 be considered. 

For every fixed u, let pxv\K = Pr{X = a;, V = v\K = k}. With these distributions pxv\K, we nave 

Pr{X = x, V = v\U = u} = p r{K = k\U = u}fxv(Pxv\ K ), (56) 

k£K 



I{K- Z\U) = H(Z\U = u)-J2 fz(Pxv\ K )Pr{K = k\U = u}, (57) 
I(K; V\U) = H(V\U = u)-J2 fv(Pxv\ K )Pr{K = k\U = u). (58) 



keK 
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I(K- Y\U) = H(Y\U = u)-J2 fY(Pxv\K)Pr{K = k\U = u}. (59) 

By the support lemma (Q, p.310), for every fixed u, the size of the alphabet of the random variable K can not 
be larger than ||Af||||V|| + 2, and therefore, ||/C|| < (||;f|| ||V|| + 2) 2 is proved. 

Appendix D 
Proof of Theorem[3] 

In this section, we will show that any triple (i?o, Ri,R e ) G TZ ci is achievable. Superposition coding and Wyner's 
random binning techniques are used in the construction of the code-books. 



Now the remainder of this section is organized as follows. The code construction is in Subsection D-A The 



proof of achievability is given in Subsection D-B 



A. Code Construction 

Since R e < I(K;Y\U) - I{K;Z\U) and R e < Ri < I(K;Y\U), it is sufficient to show that the triple 
{R a ,Ri,R e = I(K;Y\U) - I(K;Z\U)) is achievable, and note that this implies that R 1 > R e = I(K;Y\U) - 
I(K;Z\U). 

Given a triple (Rq, Ri, R e ), choose a joint probability mass function pu,k,v.x,y,z(u, k, v, x, y, z) such that 

< R e < Ri, 
Ro<I(U;Z), 
R 1 <I(K;Y\U), 
R e = I(K;Y\U)-I{K;Z\U). 
The confidential message set S and the common message set T satisfy the following conditions: 

lim ibg||5||=iei = J(iif;y|Cr)-7i, (60) 

N^-oo iv 

lim -log || T \\=R Q =I(U;Z)- 1 , (61) 

N— foo iv 

where 7 and 71 are fixed numbers and 7 > 0, 

< 71 < (a) I{K;Z\U). (62) 



Note that (a) is from R x > R e = I(K;Y\U) - I{K;Z\U) and Let S = {1, 2, 2 NRl } and T = 

{1,2,..., 2^o } . 

Code-book generation: 

. (Construction of U N ) 

For a given common message t (t g T), generate a corresponding sequence u N (t) i.i.d. according to the 
probability mass function pjj(u). 
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2^(71 codewords per bin 



2"(HK;Y\u)-^.h) codewords of K N 



Number of bins 2 jVJ?1 = 2 Ar < / ( K : y l t/ > J »> 



Fig. 5: Code-book construction for K in Theorem 



(Construction of K N ) Classical superposition coding and Wyner's random binning technique |fl5ll are used 
in the construction of K N , see Figure [5] 

For the transmitted sequence u N (t), generate 2 N ( I ( K ' Y \ u " > ~ ei - N ' > (e^jv -4 as JV -> oo) i.i.d. sequences 
k N , according to the probability mass function PK\u(ki\ui(t)). Distribute these sequences at random into 
_ 2 N ( I ( K ' Y \ u )~ii) bins such that each bin contains 2 Ar ^ 7l_ei Jv ^ sequences. Index each bin by i £ 
{1,2,. .,2^}. 

Here note that the number of the sequences in every bin is upper bounded as follows. 

7i-ei,iV< (o) I(K;Z\U)-e liN , (63) 



where (a) is from (62 1. This implies that 



lim H(K"\U",S,Z")=0. (64) 

N— >oc 

For a given confidential message s (s £ S), randomly choose a sequence k N (u N (t)) in bin s for transmission. 
• (Construction of X N ) The x N is generated according to a new discrete memoryless channel (DMC) with 
inputs k N , u N , v N , and output x . The transition probability of this new DMC is p x \u,k,v { x \ u i k, v). The 
probability Px n \u n ,k N ,v N (x N \u , k N , v N ) is calculated as follows. 

N 

Px«\u n ,k«,v»{x n \u n ,k N ,v N ) = ^\_Px\u,K,v{xi\ui,ki,Vi). (65) 

Decoding: 

Receiver 2: Given a vector z N £ Z , try to find a sequence u N (t) such that (it (t), z ) £ Tff z (e). If there 
exists a unique sequence, put out the corresponding t. Otherwise, declare a decoding error. 

Receiver 1: Given a vector y N £ y , try to find a sequence u N such that (u , j/ ) € T^ Y ( e )- If suc h a sequence 
does not exist, or there are more than one such sequence, declare a decoding error. Denote the corresponding 
sequence by u N (i), put out the corresponding index t. 

After decoding u N (t) and i, try to find a sequence k N (u N {i)) such that (k N \u N (£)) , y N ') £ T^ Y|[/ (e). If there 
exist sequences with the same index of the bin s, put out the corresponding s. Otherwise, declare a decoding error. 
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B. Proof of Achievability 

By using the above definitions, it is easy to verify that limjv-j-oo 1 ° s j[ r 7 "^ = Rq and limjv-j-oo 1 ° s jj r ' s ^ = R\. 

Then, note that the above encoding and decoding scheme is similar to the one used in fl4l . Hence, by similar 
arguments as in lfl4ll . it is easy to show that P e \ < e and P e 2 < e, and the proof is omitted here. It remains to 
show that limjv-i-oo A > R e , see the following. 



lim A = lim ±H(S\Z N ) 

N->oo N->oo iv 

> lim — H(S\Z N ,U N ) 

- AT^oo N 

= lim ^-(H(S,Z N ,U N )-H(Z N ,U N )) 

= lim -(H(S,Z N ,U N ,K N )-H(K N \Z N ,U N ,S)-H(Z N 1 U N )) 

N—i-oo N 

( = } lim -(H(Z N \U N ,K N )+H(U N ,K N ,S)~H{K N \Z N 1 U N ,S)-H{Z N ,U N )) 
^ l im ±-(H(Z N \U N ,K N ) + H(U N ,K N )-H(K N \Z N ,U N ,S)-H(Z N ,U N )) 

N—too N 

= lim -(H(K N \U N )~H{K N \Z N ,U N 1 S)~I(Z N ;K N \U N )) 

N—tOO _/V 

> lim ^-{H(K N \U N )-H(K N \U N ,Y N )-H(K N \Z N ,U N ,S)-I(Z N ;K N \U N )) 
= lim hl{Y N ;K N \U N )-H(K N \Z N ,U N ,S)-I(Z N ;K N \U N )) 

= lim ±-(NI(Y;K\U) - H(K N \Z N ,U N ,S) - NI(K;Z\U)) 
( = } lim ±(NI(Y;K\U)-NI(K;Z\U)) 

iV— >-oo IV 

= I(Y;K\U)-I{K;Z\U)=R e , (66) 
where (a) is from S ->■ (U N ,K N ) -> Z N , (b) is from H(S\U N ,K N ) = 0, (c) is from that V N , U N , K N and 



X N are i.i.d. generated random vectors, and the channels are discrete memoryless, and (d) is from ( 64 1. 
Thus, limAr_ i . 00 A > R e is proved, and the proof of Theorem [3] is completed. 

Appendix E 
Proof of Theorem|4] 

In this section, we prove Theorem]^ all the achievable (Rq, R±,R e ) triples are contained in the set TZ^ co \ Suppose 
(Ro, Ri, R e ) is achievable, i.e., for any given e > 0, there exists a channel encoder-decoder (N, A, P e i, P e 2) such 
that 

lim bg | J 11 = Ro, lim lQg H 511 =R U lim A > R e , P el < e, P e2 < e. 
Then we will show the existence of random variables (U, K, A) — > (X, V) — » Y — > Z such that 

0<R e <Ri, (67) 
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Ro<I(U;Z), 
Rl < I(K;Y\U), 
Re < I(K;Y\U)-I(K;Z\A). 



(68) 
(69) 
(70) 



The formula d67b is from 



R e < lim A= lim — H(S\Z N ) < lim —H(S)=R 1 . 

- /V-s-oo W^oo N W->oo AT 

Since the model of Figure [2] with causal side information is a special case of the model of Figure [2] with 
noncausal side information, the formulas ( |68] l, ( |69| ) and ( f70| > are obtained from f7T| ), ( f72] > and f73] l, respectively, 
see the following. 

Proof of ( |6<S| : The parameter i?o of ( |68] > can be written as follows, 

ff(T) 



i?o 



lim 



<W lim (^(P e2 ) 



< lim ( — S(P e2 ) 



i=l 

J- - H^il^- 1 , Z*-\T, V&))) 



<W i im( <J( e ) + iJ(Z)-ff(Z|E/)) 



(71) 



where (a) follows from ( 30 » and (34i, and Vi is independent of (Z l 1 ,T,V£[ 1 ), the formula (b) is from the 
definitions U = (T, V/J^, Y" 7-1 , Z J -\J), Z = Zj, and the formula (c) follows from e -> 0. Thus, the proof of 



(68 i is completed. 

Proof of ( |69| : The parameter i?i of ( |69| can be written follows, 

72(5) 



i? 



i — 



lim 

JV-voo A^ 



JY 



^ A m oo ( A7 E ^ ^ ^' + 

<( 3 ) /(ir ; y|E/), 



(72) 



where (1) follows from (31 1, the formula (2) is from (36l and the fact that 7(5; V l \T, V$ l5 Y i-1 , Z 1 - 1 ) = 0, the 
formula (3) is from the definitions U = (T, Vf +X , Y J ~ l , Z J_1 , J), K = S, Y = Yj, ( J is independent of 5, 
T, X N , V N , Y N and Z N , and J is uniformly distributed over {1, 2, A/}) and the fact that e ->• 0. Thus, the 
formula d69]l is proved. ■ 



2S 



Proof of ( |70| : The parameter R e of ( |70| i satisfies 



AT->oo TV 



1 w 



i=l 

/(s^i^-St,^)) + i«y(p el ) + ^«J(p e2 )) 

I(K;Y\U)-I(K;Z\A), (73) 



where (a) follows from ( |33| , the formula (b) is from ( [36] ) and ( |39[ >, and the fact that Vi is independent of 

^Z i - 1 ,Y i - 1 ,T,S,V^ 1 ), the formula (c) is from the definitions U = (T,Vf +v Y J - 1 ,Z J - 1 ,J), K = S, A = 
(T, Vf +1 ,Z J -\j), Y = Yj, Z = Zj, and the fact that P e i,P e2 < e -> 0. Thus, the proof of J70I is completed. 



The proof of Theorem [4] is completed. 

Appendix F 
Proof of Theorem|5] 

In this section, we will show that any triple (i?o, Pi, Re) & lZ n f l is achievable. Superposition coding, Gel'fand- 
Pinsker's binning, block Markov coding and Ahlswede-Cai's secret key on feedback |lj are used in the construction 
of the code-books. In addition, the encoding and decoding scheme for Theorem [5] can be also viewed as a 
combination of Steinberg's method [ 14], block Markov coding and Ahlswede-Cai's secret key on feedback (TJ. 



Now the remainder of this section is organized as follows. The code construction is in Subsection F-A The proof 



of achievability is given in Subsection F-B 



A. Code Construction 

Given a triple (P , P 1; R e ), choose a joint probability mass function Pu,kvxy,z( u i k, v, x, y, z) such that 

< R e < Pi, 
Ro<I(U;Z)-I(U;V), 
Ri<I(K;Y\U)-I(K;V\U), 
Re < H{Y\Z). 

The confidential message set S and the common message set T satisfy the following conditions: 

Jim i log || S ||= P x = I(K- Y\U) - I(K; V\U) - 71, (74) 

lim ^log \\T\\=R = I(U;Z)-I(U;V)- 1 , (75) 

iv— >-oo iv 
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where 7 and 7! are fixed positive real numbers. Let S = {1, 2, 2 NRl } and T = {1, 2, 2^°}. 

We use the block Markov coding method. The random vectors U N , K N , V , X , Y and Z N consist of n 
blocks of length N. The common message for n blocks is T n = (T 1; T n ), which is composed of n i.i.d. random 
variables uniformly distributed over T. The confidential message for n blocks is S n = (S 2 , S3, S n ), where Si 
(2 < i < n) are i.i.d. random variables uniformly distributed over S. Note that in the first block, there is no S\. 

Let Zi (1 < i < n) be the output of channel 2 for block i, Z n = (Z 1: Z n ), = (Z t ,..., Z^ u Z j+ x, Z n ) 
(1 < j < n). Similarly, Y n = (Y\, Y n ), and Y$ (1 < i < n) is the output of channel 1 for block i. The specific 
values of the above random vectors are denoted by lower case letters. 

Code-book generation: 

. (Construction of U N ) 

The construction of U N for each block is exactly the same as that in the proof of Theorem[T] see the following. 
For the i-th block (1 < i < n), generate 2 A '( / ( ,7;Z ) -ei . N ) (e^jv — > as N — > 00) i.i.d. sequences u N , according 
to the probability mass function pu{u). Distribute these sequences at random into 2 NRo = 2 Ar ( / ( c/;Z )~ / ( c/;y )~ 7 ) 
bins such that each bin contains 2 JV ( / ( £/;V ') +7-ei < JV ) sequences. Index each bin by ti G {1,2, ...,2 NR °}. 
For a given common message ^ (ti € T) and side information v N , try to find a sequence in bin ti 
{u N (t i ,l),u N (t i ,2),... : u N (t i ,2 N ( I ( u W + '<- ei - N, >)} that is jointly typical with v N , say u N (UJ*), i.e., 
(u N (U, j*),v N ) € Tjj V (e\). If multiple such sequences in bin ti exist, choose the one with the smallest j*. 
If no such j* exists, then declare an encoding error. 
. (Construction of K N ) 

Superposition coding, Gel'fand-Pinsker's binning, block Markov coding and Ahlswede-Cai's secret key on 
feedback Q are used in the construction of K N , see Figure [6] 

For the transmitted sequence u N (ti,j*), generate 2 Ar ( / (- ftr;Y l l/ ) -C2 - JV ' (fi2,N as N —> 00) i.i.d. sequences 
k N , according to the probability mass function PK\u{^i\ u i)- Distribute these sequences at random into 2 NRl = 
2 N(i(K;Y\u)-i(K;V\U)- 7l ) bins such that each bin contains 2 N ( I (K;V\U)+ 11 -e 2 , N ) seqU ences. Index each bin 

by le {1,2, ..,2^1}. 

In the first block, for a given side information v N , try to find a k N (u N (ti, j*)) generated by u N (ti,j*) such that 
(k N (u N (ti, j*)),v N ) £ ^KVlui 6 )' ^ multiple such sequences exist, randomly choose one for transmission. 
If no such sequence exists, declare an encoding error. 

For the i-th block (2 < i < n), firstly we generate a mapping gt : y N — > S (note that HD^H^ >|| S ||). Define 
a random variable K* = gf(Yi-i) (2 < i < n), which is uniformly distributed over S, and K* is independent 
of Reveal the mapping gf to both receivers and the transmitter. 

Then, when the transmitter receives the output yi-i of the z- 1-th block, he computes k* = £ S. For 

a given Sj, the transmitter chooses a sequence k N (u N (ti, j*)) from the bin Sj © k* (where © is the modulo 
addition over S) such that (k (u N (ti, j*)), v N ) £ T^ v ^ u (e). If multiple such sequences in bin Sj © k* exist, 
choose the one with the smallest index in the bin. If no such sequence exists, declare an encoding error. 
• (Construction of X N ) 
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2 w(/(K- ; v|r/)+7.-^,») codewords per bin 
Number of bins 2 JVH ' = 2W«*^|i0-Jr(*;V|»)-n) 
2 . V( /(K ; y|L0- e2 .,,) codewords ofK A- 

Sj © ft* determines which bin to choose 



Fig. 6: Code-book construction for K N in Theorem [5] where Sj and fc* are the confidential message and the secret 
key for block i, respectively. 



For each block, the a; is generated according to a new discrete memoryless channel (DMC) with inputs k , 
u N , v , and output x N , The transition probability of this new DMC is Px\u,k,v{ x \ u i k, v). The probability 

Px n \u n ,k n ,v n (x N \u N , k N , v N ) is calculated as follows. 

N 

Px N \u N ,k n y N {x N \u N ,k N ,v N ) = Y[px\u,K,v(xi\ui, ki,Vi). (76) 

i=l 

Decoding: 

Receiver 2: For each block, given a vector z N £ Z , try to find a sequence u N (t,i) such that (u (t,i), z N ) £ 
Tff z (es). If there exist sequences with the same t, put out the corresponding t. Otherwise, i.e., if no such sequence 
exists or multiple sequences have different message indices, declare a decoding error. 

Receiver 1: For the first block, given a vector y\ € y , try to find a sequence u N such that (u , y\) £ Tjj Y {ci). 
If such a sequence does not exist, or there are more than one such sequence, declare a decoding error. Denote the 
corresponding sequence by u N (ti,j), and put out the corresponding index t\. 

For block i (2 < i < ri), given a vector yi 6 y , try to find a sequence ti 1 * such that (u N ,yi) £ Tff Y (e4). If 
such a sequence does not exist, or there are more than one such sequence, declare a decoding error. Denote the 
corresponding sequence by u N (ti,j), put out the corresponding index fj. 

After decoding u N (U,j) and ti, try to find a sequence k (u N (tj, j)) such that (k N (u N (ii, j)), yi) £ T^ Y ^ u (e^). 
If there exist sequences with the same index of the bin Sj © fc*, put out the corresponding s, © k*. Otherwise, 
declare a decoding error. Finally, since receiver 1 knows k* (k* = gfj(^_i)), put out the corresponding Sj from 

© /c* . 

B. Proof of Achievability 

The rate of the confidential message S n is defined as 

m = lim lim (77) 

AT-foon-foo 71JV 



31 



and it satisfies 



Rl = lim lim H< " S ^ 



N— too n— >oo nN 



lim lim 



N— >oo n— >oo 7J./V 

= lim lim — 

N— >oo n— >oo niV 

= i?L (78) 
Similarly, the rate of the common messages T n satisfies 

R* = lim lim 

N— s-oo n— >oo niV 

= Ro- (79) 

In addition, note that the encoding and decoding scheme for Theorem [5] is exactly the same as that in fl4l . 
except that the transmitted message for receiver 1 is s © k* . Since receiver 1 knows k* , the decoding scheme for 
Theorem [5] is in fact the same as that in lfl4]l . Hence, we omit the proof of P e \ < e and P e 2 < e here. 

It remains to show that limjv_ i . 00 A > R e , see the following. 

Since the confidential message S is encrypted by S®K*, the equivocation about S is equivalent to the equivocation 
about K* . There are two ways for receiver 2 to obtain the secret key k* . One way is that he tries to guess the k* 
from its alphabet S. The other way is that he tries to guess the feedback y N (y N is the output of channel 1 for 
the previous block, and k* — gf(y N )) from the conditional typical set Xjy| Z ](<5), and this is because for a given 
z N and sufficiently large N, Pr{(y N <£ T^. [Z] (S)} -> 0. Note that there are 2 NH ^ Z '> sequences y N £ (5) 



when TV — >• oo and S —> 0. Therefore, the equivocation about S is min{ " = R\, H(Y \Z)}, and note that 
Ri > R e and H(Y\Z) > R e , then limjv^oo A > R e is obtained. 
The details about the proof are as follows. 

First, we will show that K* © Si is independent of K* and Si, and this is used in the proof of lim^v-i-oo A > R e . 
Since K* is independent of Si (2 < i < n), and all of them are uniformly distributed over S, the fact that 



K* © Si is independent of K* and Si is proved by the following (80 1 and (81 1. 



Pr{K* © S % = a} = Pr{K* © S t = a, K* = k*} 



1 
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Pr{K* ®Si = a, K* = k*} = Pr{ Si = a 9 k* , K* = k* } 

= Pr{ S,; = a k* }Pr{K* = k* } 
1 



ISII 2 ' 



(81) 



Then, limjv^oo A > R e is proved by the following (82i 



lim A = lim lim 



N^roo N^oo n—too nN 

= Km lim i^m^n 

=W lim lim SL 2 g(W,^i) 

N— ¥oo n— >oo nN 

> lim lim ELi^fe^i^Mi) 

N-too n—too nN 

=<» lim lim &^*l^l.*©*?) 





nJV 








nJV 


EH 






71 JV 




:2 min{JV J ff(y|2'),JV J R 1 } 




71 JV 


(n- 


- l)mm{JV#i,JViI(y|Z)} 


nJV 



TV— >-oo n— >oo 

= lim lim 

TV— >-oo n— too 

lim lim 

N—too n— foo 
JV— >oo n— >oo 

= lim lim 

TV— >-co n— too 

= mva{R x ,H{Y\Z)} 

> Re, (82) 

where (a) is from $ -» (Z^Z^) -> Z l ~ 2 , is a Markov chain, (b) is from Si ->■ {Si®K$,Zi-i) -> Z, 

is a Markov chain, (c) follows from the fact that K* © S{ is independent of K* and Zi-\, and (d) is from the fact 
that receiver 2 can guess the specific vector Yi-i (corresponding to the key K*) from the conditional typical set 
Tjy| Z j(5), and K* is uniformly distributed over S (K* is the key used in block i). 
Thus, limjv^oo A > R e is proved, and the proof of Theorem [5] is completed. 

Appendix G 
Proof of Theorem|6] 

In this section, we prove Theorem [H] all the achievable (Rq, R%, R e ) triples are contained in the set TZ^ n ^°\ 
Suppose (Rq, Ri,R e ) is achievable, i.e., for any given e > 0, there exists a channel encoder-decoder (JV, A, P ei , P e2 ) 
such that 

l im lQ S ILL l =R , lim lQg H 511 =R 1 , lim A > R e , P el < e, P e2 < e. 

AT-j-oo JV N^too N N^oo 
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Then we will show the existence of random variables (U, K, A) — > (X, V) — » Y — » Z such that 

0<R e <Ri, (83) 

R <I(U;Z)-I(U;V), (84) 

Ri<I(K;Y\U,A)-I(K;V\U,A), (85) 

R + Ri< I(U, K, A; Y) - I(U, K, A; V), (86) 

R e < H(Y\Z). (87) 
The proof of ( |83"] l, ( [84| , ( |85] l and ( [8o*| are the same as those in the proof of Theorem [2] and therefore, we omit 



it here. It remains to prove (87i, see the following. 



^H{S\Z N ) <W l H (S\Z N ) + ±5(P el )-±H(S\Y N ,Z N ) 



ll(S;Y N \Z N ) + ±S(P el ) 
L H {Y N \Z N ) + ±6(P el ) 

N 



z=l 

1 N 1 

-^ J ff(y i |z i) j = i) + - ( y(p el ) 



=( & ) 



i=i 

= H(Yj\Zj,J) + ±S(P el ) 

=<°> ff(y|z,j) + i^(p el ) 



< 



ff(yfz) + i<y(c), (88) 



where (a) is from Fano's inequality, (b) is from J is independent of Y N and Z" , and (c) is from Y = Yj and 
Z^Zj. 



By using limjv^oo A > i? e , A = j^H{S\Z N ), (88 i and letting e -> 0, the formula (87 1 is proved. 
The proof of Theorem [6] is completed. 

Appendix H 
Proof of Theorem|7] 

A. Converse Part of Theorem [7] 

The converse proof of Theorem [7] is exactly the same as the proof of Theorem |5J except that R e < H(Y\Z). 
Note that the proof of R e < H(Y\Z) is the same as that in the proof of Theorem[6j and hence we omit the proof 
here. 
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B. Direct Part of Theorem [7] 

In this section, we will show that any triple (Rq, Ri, R e ) £ 1Z C J is achievable. 

1) Code Construction: Given a triple (Rq, Ri,R e ), choose a joint probability mass function pu,K,V,x,Y,z(u, k, v, x, 
such that 

< R e < Ri, 
Ro<I(U;Z), 
Ri <I(K;Y\U), 
Re < H(Y\Z). 

The confidential message set S and the common message set T satisfy the following conditions: 

Jim ^logWS^R^IiKiYlU)-^, (89) 

TV— »oo 1\ 

Jim ±-log\\T\\=Ro = I(U;Z)- 7 , (90) 

where 7 and 71 are fixed positive numbers. Let S = {1, 2, 2 NRl } and T = {1, 2, 2^°}. 

We use the block Markov coding method. The random vectors U N , K , V , X^, F w and Z w consist of n 
blocks of length N. The common message for n blocks is T n = (Ti, T n ), which is composed of n i.i.d. random 
variables uniformly distributed over T. The confidential message for n blocks is S n — (S2, S3, S n ), where Si 
(2 < i < n) are i.i.d. random variables uniformly distributed over S. Note that in the first block, there is no Si. 

Let Zi (1 < i < n) be the output of channel 2 for block i, Z n — (Z\, Z n ), Z- 7 = [Z\, Zj-i, Zj+i, Z n ) 
(1 < j < n). Similarly, Y n = (Y{, Y n ), and Yj, (1 < i < n) is the output of channel 1 for block i. The specific 
values of the above random vectors are denoted by lower case letters. 

Code-book generation: 

. (Construction of U N ) 

The construction of U N for each block is exactly the same as that in the proof of Theorem[3] see the following. 
For the i-th block (1 < i < n), given a common message ti (ti £ T), generate a corresponding u N (ti) i.i.d. 
according to the probability mass function pu(u). 
. (Construction of K N ) 

In the first block, for a given u N (ti), generate a corresponding sequence k N i.i.d. according to the probability 
mass function PK\u(ki\ui). 

In the i-th block (2 < i < n), firstly we generate a mapping gj : y N — > S. Define a random variable 
K-i — 5/ O^-i) (2 < « < n), which is uniformly distributed over S, and K* is independent of Si. Reveal the 
mapping gf to both receivers and the transmitter. Then, when the transmitter receives the output of the 
i-l-th block, he computes k* = gf(yi-i) £ S. 
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Given the transmitted sequence u N (ti) and the encrypted confidential message Sjffifc* (where © is the modulo 
addition over S), generate a corresponding sequence k N i.i.d. according to the probability mass function 
PK\u(ki\ui). Index k N by Sj © fe* e S. 
. (Construction of X w ) 

For each block, the x N is generated according to a new discrete memoryless channel (DMC) with inputs fc , 
u w , u , and output x N . The transition probability of this new DMC is Px\U,K,v(x\u, k, v). The probability 
Px N \u N .k n ,v N (x N \u N , k , v ) is calculated as follows. 



Px«\u n ,k«,v«{x n \u n ,k N ,v N ) = Y[px\u,K,v(xi\ui, ki,Vi). (91) 

1=1 

Decoding: 



Receiver 2: For each block, given a vector z £ Z , try to find a sequence u (i) such that (u (f),z ) £ 



^7z( £ 3)- ^ there exists a unique sequence, put out the corresponding t. Otherwise, declare a decoding error. 



Receiver 1: For the first block, given a vector y\ € y , try to find a sequence u N such that (u , yi) € T^y- (64). 
If such a sequence does not exist, or there are more than one such sequence, declare a decoding error. Denote the 
corresponding sequence by u N (ti), put out the corresponding index t%. 

For block i (2 < i < ri), given a vector yi £ y , try to find a sequence ti 1 * such that (u N ,yi) £ T^ Y { € 4)- If 
such a sequence does not exist, or there are more than one such sequence, declare a decoding error. Denote the 
corresponding sequence by u N (ti), put out the corresponding index U. 

After decoding u N (ti) and £j, try to find a sequence k N (u N (ti)) such that (k N (u N (ti)),yi) £ '^ky\u( £5 ')- ^ 
there exists a sequence, put out the corresponding index Si©fc*. Otherwise, declare a decoding error. Finally, since 
receiver 1 knows k* (k* = <?/(yj-i)), put out the corresponding Sj from s, © fc*. 

2) Proof of Achiev ability: Note that the encoding and decoding scheme for Theorem [7] is exactly the same as 
that in [fT4l . except that the transmitted message for receiver 1 is s © k*. Since receiver 1 knows k*, the decoding 
scheme for Theorem [7] is in fact the same as that in [14|. Hence, we omit the proof of P e i < e and P e 2 < e here. 

It remains to show that limyv-i-oo A > R e , see the following. 

Since the confidential message S is encrypted by S®K*, the equivocation about S is equivalent to the equivocation 
about K* . There are two ways for receiver 2 to obtain the secret key k*. One way is that he tries to guess the k* 
from its alphabet S. The other way is that he tries to guess the feedback y N (y N is the output of channel 1 for 
the previous block, and k* = gf(y N )) from the conditional typical set T^^(8), and this is because for a given 
z N and sufficiently large N, Pr{(y N <£ T$, z] (5)} -> 0. Note that there are 2 NH(Y ^ sequences y N £ TE {Z] (S) 
when TV — > 00 and 5—^-0. Therefore, the equivocation about S is min{ log J' s ^ = R 1: H(Y\Z)}, and note that 
Ri > R e and H(Y\Z) > R e , then limjv-i.00 A > R e is obtained. 



The detail about the proof of limTv-s-oo A > i? e is exactly the same as (82i, and it is omitted here. 
Thus, lkujv->-oo A > R e is proved, and the direct part of Theorem [7J is completed. 
The proof of Theorem [7J is completed. 



36 



References 

[1] R. Ahlswede and N. Cai, "Transmission, Identification and Common Randomness Capacities for Wire-Tap Channels with Secure Feedback 
from the Decoder," book chapter in General Theory of Information Transfer and Combinatorics, LNCS 4123, pp. 258-275, Berlin: Springer- 
Verlag, 2006. 

[2] E. Ardestanizadeh, M. Franceschetti, T.Javidi and Y.Kim, "Wiretap channel with secure rate-limited feedback," IEEE Trans Inf Theory, 

vol. IT-55, no. 12, pp. 5353-5361, December 2009. 
[3] Y. Chen, A. J. Han Vinck, "Wiretap channel with side information," IEEE Trans Inf Theory, vol. IT-54, no. 1, pp. 395-402, January 2008. 
[4] I. Csiszdr and J. Korner, "Broadcast channels with confidential messages," IEEE Trans Inf Theory, vol. IT-24, no. 3, pp. 339-348, May 

1978. 

[5] I. Csiszar and J. Korner, Information Theory. Coding Theorems for Discrete Memoryless Systems. London, U.K.: Academic, 1981. 
[6] M. H. M. Costa, "Writing on dirty paper," IEEE Trans Inf Theory, vol. IT-29, no. 3, pp. 439-441, 1983. 

[7] S. I. Gel'fand and M. S. Pinsker, "Coding for channel with random parameters," Problems of Control and Information Theory, vol. 9, no. 
1, pp. 19-31, 1980. 

[8] J. Korner and K. Marton, "General broadcast channels with degraded message sets," IEEE Trans Inf Theory, vol. IT-23, no. 1, pp. 60-64, 
January 1977. 

[9] N. V. Kuznetsov and B. S. Tsybakov, "Coding in memories with defective cells," Problemy peredachi informatsii, vol. 10, no. 2, pp. 52-60, 
1974. 

[10] L. Lai, H. El Gamal and V. Poor, "The wiretap channel with feedback: encryption over the channel," IEEE Trans Inf Theory, vol. IT-54, 
pp. 5059-5067, 2008. 

[11] N. Merhav, "Shannon's secrecy system with informed receivers and its application to systematic coding for wiretapped channels," IEEE 
Trans Inf Theory, special issue on Information-Theoretic Security, vol. IT-54, no. 6, pp. 2723-2734, June 2008. 

[12] C. Mitrpant, A. J. Han Vinck and Y. Luo, "An Achievable Region for the Gaussian Wiretap Channel with Side Information," IEEE Trans 
Inf Theory, vol. IT-52, no. 5, pp. 2181-2190, 2006. 

[13] C. E. Shannon, "Channels with side information at the transmitter," IBM Journal Research and Development, vol. 2, pp. 289-293, October 
1958. 

[14] Y. Steinberg, "Coding for the degraded broadcast channel with random parameters, with causal and noncausal side information," IEEE 

Trans Inf Theory, vol. IT-51, no. 8, pp. 2867-2877, August 2005. 
[15] A. D. Wyner, "The wire-tap channel," The Bell System Technical Journal, vol. 54, no. 8, pp. 1355-1387, 1975. 



